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PROBE 

The present invention relates to DNA probes for 
detecting a tandemly-repeated nucleotide sequence in the 
gene encoding mucin glycoprotein expressed by human 
mammary epithelial cells, to the use of the probe in 
diagnosis and in "fingerprinting" individuals, to the 
polypeptides expressed by the corresponding mucin gene, to 
antibodies against the polypeptides and to the use of the 
polypeptides and antibodies in the diagnosis and 
therapeutic treatment of cancer. 

Normal and malignant human mammary epithelial 
cells express high molecular weight glycoproteins (gps) 
which are extensively glycosylated and very antigenic. As 

* 

a result, many of the monoclonal antibodies (MAbs) selected 
for reactivity with human breast cancer and other 
carcinomas are found to react with molecules which are 
produced in abundance by the fully differentiated human 
mammary tissue and are found in the milk fat globule (MFG) 
and in milk. However, the level of expression of a. 
particular antigenic determinant may be different in the 
gps produced by the normal differentiated cell and in the 
similar molecules produced by breast cancers. This means 
that some antibodies can show a certain specificity for 
reacting with tumour gps. 
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The molecules bearing the epitopes recognised by 
these antibodies are complex and have been difficult to 
analyse, both because they are large and heavily 
glycosylated (>250,000 daltons) and because of the complex 
pattern of expression. Two of the MAbs, HMFG-1 and -2, 
react with a component in human milk which appears to be 
greater than 40.0/000 daltons, whereas the molecules found 
in sera and tumours are smaller, although the dominant 
components are still greater than 200,000 daltons on 
immunoblots. The large glycoprotein produced by the 
differentiated mammary epithelial cells found in human milk 
or in the milk fat globule has been purified and shown to 
have some of the characteristics of the mucins. This 
component contains a large amount of carbohydrate joined in 
O-linkage to serine and threonine residues via the linkage 
sugar N-ace ty lgalactosamine . Moreover, the core protein 
contains high levels of serine, threonine and proline and 
low levels of aromatic and sulphur containing amino acids. 

These mucin-like glycoproteins are also secreted 
by a number of other normal epithelial cells. The 
monoclonal antibody HMFG-1 is highly reactive with the milk 
mucin and evidence suggests that the epitope recognised by 
this antibody is more abundant on the fully processed 
mucin, characteristic of normal differentiation. 

In tumours, the molecular weight of the molecules 
carrying these antigenic determinants differs among 
individual tumours and, in the case of the components 
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recognised by the HMFG-2 antibody, can range from 80-400K 
daltons. Although it appears that the differences observed 
. in the mobility of the high molecular weight bands are due 
to genetic polymorphism this probably does not explain 
variations in the size of the lower bands. It has been 
proposed that these may be the result of aberrant 
processing occurring in the tumour cell possibly within the 
glycosylation pathways . 

For the majority of the monoclonal antibodies 
reacting with this group of molecules the exact nature of 
the antigenic epitopes remains unclear but circumstantial 
evidence has suggested that carbohydrate may at least be 
partly involved in many of the epitopes. Moreover, from 
previously available data it was not known whether the 
mucin found in the normal differentiated cells, and that 
observed in the tumours, contain the same core protein, or 
just carry common carbohydrate determinants. 

Mucin has now been isolated from human milk by 
affinity chromatography enabling identification of the core 
protein and the gene encoding the protein. This has been 
found to be a highly polymorphic gene defined by the peanut 
urinary mucin (PUM) locus [see Swallow et al . , Disease 
Markers , 4, 247, (1986) and Nature , 327 , 82-84 (1987)]. 
The gene product, which is hereafter referred to as human 
polymorphic epithelial mucin or HPEM, has been detected in 
breast tumours and other carcinomas as well as in some 
normal epithelial tissues. 
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It has now been found that the HPEM core protein 
has epitopes which also appear in the aberrantly processed 
gps produced by adenocarcinoma cells. Certain of these 
epitopes are not exposed in the fully processed mucin 
glycoprotein produced by the lactating mammary gland. 

In one aspect the present invention therefore 
provides an antibody against a human mucin core protein - • 

which antibody substantially does not react with a fully 
processed human mucin glycoprotein. 

As used herein the term "antibody" is intended to 
include fragments of antibodies bearing antigen binding 
sites such as the Ffab'Jj fragments. 

Antibodies according to the present invention 
react with HPEM core protein, especially as expressed by 
colon, lung, ovary and particularly breast carcinomas, but 
have reduced or no reaction with the corresponding fully 
processed HPEM. In a particular aspect the antibodies 
react with HPEM core protein but not with fully processed 
HPEM glycoprotein as produced by the normal lactating human 
mammary gland. 

Antibodies according to the present invention 
have no significant reaction with the mucin glycoproteins 
produced by pregnant or lactating mammary epithelial 
tissues but react with the mucin proteins expressed by 
mammary epithelial adenocarcinoma cells. These antibodies 
show a much reduced reaction with benign breast tumours and 
are therefore useful in the diagnosis and localisation of 
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breast cancer as well as in therapeutic methods. 

The antibodies may be used for other purposes 
including screening cell cultures for the polypeptide 
expression product of the human mammary epithelial mucin 
gene, or fragments thereof, particularly the nascent 
expression product. In this case the antibodies may 
conveniently be polyclonal or monoclonal antibodies. 

Antibodies according to the present invention may 
be produced by innoculation of suitable animals with HPEM 
core protein or a fragment thereof such as the peptides 
described below. Monoclonal antibodies are produced by the 
method of Kohler & Milstein (Nature 256 , 495-49 7/197 5) by 
immortalising spleen cells from an animal innoculated with 
the mucin core protein or a fragment thereof, usually by 
fusion with an immortal cell line (preferably a myeloma 
cell line) , of the same or a different species as the 
innoculated . animal , followed by the appropriate cloning and 
screening steps. 

In a particular aspect the present invention 
provides the monoclonal antibodies designated SM3 against 
the HPEM core protein. In another aspect the invention 
provides the hybridoma cell line which secretes the 
antibodies SM3 and has been designated HSM3. Samples of 
HSM3 have been deposited with ECACC on 7th Janu*ary 1987 
under accession number 87010701. 

Using antibodies according to the invention it 
has been possible to screen a phage library constructed 
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from mRNA isolated from a human breast cancer cell line to 
identify sequences coding for portions of the mucin core 
protein. Complementary DNA sequences have been constructed 
and from these it has surprisingly been found that the gene 
encoding the core protein contains multiple tandem repeats 
of a 60 base sequence leading to considerable polymorphism 
sufficiently extensive that cDNA fragments corresponding to 
the repeat sequence would be useful for fingerprinting DNA. 
The fingerprinting thus made possible has applications in 
for instance ascertaining whether bone marrow growth after 
transplants is from the host or the donor and in forensic 
medicine for identifying individuals using body tissues or 
fluids. 

* 

Accordingly the present invention also provides a 
nucleic acid fragment comprising at least 17 nucleotide 
bases the fragment being hybridisable with at least one of 

a) the DNA sequence 

5 ' 

ACC GTG GGC TGG 
GGC CGG CCT GGT 
ACC GTG GGC TGG 
GGC CGG CCT GGT 

b) DNA compl ementary 

GTC ACC TCG GCC 
CCG GGC TCC ACC 
GTC ACC TCG GCC 
CCG GGC TCC ACC 



* 

GGG GGC GGT GGA GCC CGG- 

GTC CGG GGC CGA GGT GAC- 

GGG GGC GGT GGA GCC CGG- 

3' 

GTC CGG GGC CGA GGT GAC 
to the DNA of a) , i.e. of sequence 
CCG GAC ACC AGG CCG GCC- 
GCC CCC CCA GCC CAC GGT- 

ccg gac acc agg ccg Gee- 
s' 

GCC CCC CCA GCC CAC GGT 
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c) RNA having a sequence corresponding to the DNA sequence 
of a) and 

d) RNA having a sequence corresponding to the complementary 
DNA sequence of b) . 

The sequences in (a) and (b) each include a double 
tandem repeat sequence of 120 bases. Fragments according to 
the invention may correspond to any portion of this sequence 
including portions bridging the start point of the repeat. 

Fragments according to the invention will 
hybridise under conditions of low stringency with the DNA 
and RNA sequences (a) to (d) above. Preferred fragments are 
those which also hybridise under conditions of high 
stringency. The most preferred fragments of the invention 
are those which have sequences exactly identical to r or 
exactly complementary to the sequences (a) to (d) above. 

Normally a given DNA or RNA fragment according to 
the invention will be capable of hybridising with both DNA 
according to a) and RNA according to c) or with both DNA 
according to b) and RNA according to d) above. 

Preferably the nucleic acid fragment according to 
the present invention will co: -rise a portion of at least 30 
nucleotide bases capable of hybridising with at least one of 

a) to d) above, more preferably at least 50 such bases and 
most preferably the fragment contains a sequence of 60 bases 
exactly complementary to one of the repeat sequences of a) , 

b) c) or d) above. Other fragments of the invention may • 
comprise two or more repeats of such a sequence, optionally 
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with minor variations by way of substitution. Preferably 
such fragments include an integral number of such repeat 
sequences. Further fragments of the invention comprise the 
tandem repeat sequence and additional coding or non-coding 
5 1 and/or 3' flanking sequences corresponding to the HPEM 
gene or a portion thereof. 

When the existence of a tandem repeat sequence was 
first identified it was believed that the sequence consisted 
of 59 base pairs corresponding with the sequences indicated 
in (a) and (b) above except for the lack of the base 
indicated with "*". 

Many fragments according to the invention as 
originally defined in British Patent Application No. 8700269 
also conform with the new definition of fragments as set out 
herein and those fragments of sequences defined under (a) , 
(b) , (c) or (d) above which do not include the bases marked 
n *" form a particular aspect of the present invention. Such 
fragments have sequences corresponding to at least a portion 
of the sequences 

a 1 ) GTG GGC TGG GGG GGC GGT GGA GGC 

a 1 ») CGG GGC CGG CCT GGT GTC CGG GGC CGA GGT GAC AC 

b 1 ) DNA complementary to the sequence of a 1 ) or a 11 ), 

c 1 ) RNA having a sequence corresponding to the sequence of 

a 1 ) or a ' ' ) and 
d 1 ) RNA having a sequence corresponding to one of the 

complementary DNA sequences of b 1 ) 

In the human genome the DNA tandem repeat sequence 
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comprises antiparallel double stranded DNA, one strand 
.having sequence (a) and being paired with a strand having 
sequence (b) . 

As mentioned above the nucleic acid fragments of 
the invention may be used as a probe for detecting one or 
other strand of the DNA tandem repeat sequence in the human 
genome, or RNA transcribed from either strand and hence for 
identifying the gene or genes for human mucin core proteins, 
mRNA transcribed therefrom and complementary DNA and RNA • 
For such purposes it may be convenient to use the complete 
normal gene comprising at least one tandem repeat sequence, 
or mRNA transcribed therefrom or to attach non-complementary 
fragments to either or both the 5' and 3 1 ends of a fragment 
according to the invention and/or to attach detectable 
labels (such as radioisotopes, fluorescent or enzyme labels) 
to the probe or to bind the probe to a solid support. All 
of these may be achieved by conventional methods and the 
nucleic acid fragments of the invention may be produced de 
novo by conventional nucleic acid synthesis techniques. 

+ m 

The nucleic acid fragments of the present 
invention may also be used in active immunisation 
techniques. In such methods the fragment codes for a 
polypeptide chain substantially identical to a portion of 
the mucin core protein and may be extended at either or both 
the 5 1 and 3' ends with further coding or non-coding nucleic 
acid sequences including regulatory and promoter sequences, 
marker sequences and splicing or ligating sites. Coding 
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sequences may code for corresponding portions of the mucin 
core protein chain or for other polypeptide chains. The 
fragment according to the invention, together with any 
necessary or desirable flanking sequences is inserted, in an 
appropriate open reading frame register, into a suitable 
vector such as a plasmid or a viral genome (for instance 
vaccinia virus genome) and is then expressed as a 
polypeptide product by conventional techniques. In one 
aspect the polypeptide product may be produced by culturing 
appropriate cells transformed with a vector, harvested and 
used as an immunogen to induce active immunity against the 
mucin core protein. In another aspect the vector, 
particularly in the form of a virus, may be directly 
innoculated into a human or animal to be immunised. The 
vector then directs expression of the polypeptide jLn vivo 
and this in turn serves as an immunogen to induce active 
immunity against the mucin core protein. 

The invention therefore provides nucleic acid 
fragments as hereinbefore defined for use in methods of 
treatment of the human or animal body by surgery or therapy 
and in diagnostic methods practised on the human or animal 
body. The invention also provides such methods for - 
treatment of the human or animal body by surgery or therapy 
and diagnostic methods practised in vivo as well as ex vivo 
and jtn vitro . 

The invention further provides a polypeptide 
comprising a series of residues encoded by the DNA tandem 
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repeat sequence, 



the sequence shown at (b) above being the 



coding sequence. 



Polypeptides according to the invention 



are selected from any of those having 5 or more amino acid 
residues represented by the iollowing amino acid sequence 

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr 
Ala Pro Pro Ala His Gly*Val Thr Ser Ala Pro Asp Thr Arg 
Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly 
("*" marks the start of the repeat in the peptide) . 



Polypeptides according to the invention may have 
a sequence corresponding with any portion of the 40 
residue sequence above and may include the start point of 
the repeat sequence. 

Other polypeptides according to the invention 
include three or more repeats of the 20 amino acid repeat 
sequence. Such polypeptides may include minor variations 
by way of substitution of individual amino acid residues. 

The invention further provides polypeptides as 
defined above modified by addition of N-acetyl 
galactosamine (a linkage sugar) on serine and/or threonine 
residues and by addition of oligosaccharide moieties to 
that or via other linkage sugars and/or fragments linked 
to carrier proteins such as keyhole limpet haemocyanin, 
albumen or thyr oglobul in . 

Preferably the polypeptide comprises at least 10 
amino acid residues of the sequence above, more preferably 
20 residues. The polypeptide may comprise the full 
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sequence above. Such polypeptides may further comprise 
additional amino acid residues, preferably conforming to 
the amino acid sequence of HPEM core protein. 

In a further aspect the present invention 
provides the HPEM core protein. This is encoded by the 
PUM gene and may be produced by recombinant DNA techniques 
and expressed without glycosylat ion in human or non-human 
cells. Alternatively it may be obtained by s tr ipping 
carbohydrate from native human mucin glycoprotein which 
itself may be "produced by isolation from samples of human 

^\ tissue or body fluids or by expression and full processing 
in a human cell line. The HPEM core protein may be used 

] s if for raising antibodies in animals for use in passive 

immunisation, diagnostic tests and tumour localisation and 

j s =f in active immunisation of humans. 

^ The invention further provides antibodies 

□ (monoclonal or polyclonal) , and fragments thereof, against 
any of the polypeptides described above. Such antibodies 
may be obtained by conventional methods and are useful in 
diagnostic and therapeutic applications. 

The invention further provides antibodies 
(monoclonal or polyclonal), or fragments thereof, linked 
to therapeutically or diagnost ically effective ligands. 
For therapeutic use of the antibodies the ligands are 
lethal agents to be deli ve red to cancerous breast or other 
tissue in order to incapacitate or kill transformed cells. 
Lethal agents include toxins, r ad ioiso topes and 'direct 
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•killing agents' such as components of complement as well 
as cytotoxic or other drugs. Further therapeutic uses of 
the antibodies inclusive passive immunisation • 

The invention further provides therapeutic 
methods comprising the administration of effective 
non- toxic amounts of such antibodies or fragments thereof 
and antibodies or fragments thereof for use in therapeutic 
treatment of the human or animal body. Especially in 
therapeutic applications it may be appropriate, to modify 
the antibody by coupling the Fab region thereof to the Fc 
region of antibodies derived from the species to be 
treated (e.g. such that the- Fab region of mouse monoclonal 
antibodies may be administered with a human Fc region to 
avoid immune response by a human patient) or in order to 
vary the isotype of the antibody. 

In the diagnostic field the antibodies may be 
linked to ligands such as solid supports and detectable 
labels such as enzyme labels, chromophores and 
fluorophores as well as radioisotopes and other directly 
or indirectly detectable labels. Preferably monoclonal 
antibodies or fragments thereof are used in diagnosis. 

The invention further provides a diagnostic 
assay method comprising contacting a sample suspected to 
contain abnormal human mucin glycoproteins with an 
antibody as defined above. Such methods i-nclude tumour 
localisation involving administration to the patient of 
the antibody or fragment thereof bearing a detectable 
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label or of an antibody or fragment thereof and, 
separately simultaneously or sequentially in either order 
a labelling entity capable of selectively binding the 
antibody or fragment thereof. The invention also provides 
antibodies or fragments thereof for use in diagnostic 
methods practised on the human or animal body. 

Particular uses of the antibodies include 
diagnostic assays for detecting and/or assessing the 
severity of breast, ovary and lung cancers. 

Diagnostic test kits are provided for use in 
diagnostic assays and comprise antibody or a fragment 
thereof, optionally suitable labels and other reagents 
and, especially for use in competitive assays, standard 
sera. 

The invention will now be illustrated by the 
following Examples and with reference to the figures of 
the accompanying drawings in which 
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JifiT* Legends 



ripUre 1; Purification of the ailk «ycin by taw o affinity 
; hroaatotraphy win* the antibody Jfl£7>3-1. Kilks f r0B ,. Ttral 

indiridual. «r. oosbined and absorbed to * H^G-1-S.pharo.e column as 
described 1b Method,. 3*. wt ,riU .lutin* at lo. pH ™ iodinated 
and .ubj.ct.d to FAGS electrophoresis and autoradiography (track 1). 
She iodinated «terial ,a, precipitated „in* the Protein A .ethod 
»itn antibodies (track 5). (track 2). BT254 (track 3) 

and RPiil 4 20J ICS (track 4). 

Ti^re_2: Cczpariscn of the 125 I labelled purified nilk «cin nth 
inkblot of te« ski=ed nilk. A, hunan ski=ed *ilk .as subjected 
to EDS polyacrylanide electrophoresis . transferred to nitrocellulose 
paper, the blot probed .ith the monoclonal antibody HCG.-j and binding 
detected using an ZLISA »ethod. B, after purification cn an »£?G-1 
affinity eolu=n foiled by 075 Sephadex chro=ato C raphy the nilk eu cin 
»*s iodinated by the Eolton and Hunter method and subjected to SDS 
polyaerylanide electrophoresis and autoradiography . 

?£r " re S l ^oradioeraphy of the iodinated tilk «ucin after treatment 
»±t* iydroten fluoride. 3he purified .ilk «ucin .«a treated »ith XT 
3 Lour, at rooa tecperature (track 1) or 1 Jbour at 4*C (track 2) 
*nd the resulting preparationj .ere then iodinated and rxm on SDS 
polyaerylanida eels. 
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riFUre * ' "•""Tlty of tfae'intact, partially .tripped or extensively 
•tripped ailk aucin with iodlnat.d lectins. The purifi d intact silk 
aucin (track 1). the aucin treated with HP for 1 Lour at 4'C (track 2) 
•ad the aucin treated for 5 hour, at rooa temperature (track 5) were 
•ubjected to SDS.polyacryl.2ide electrophor.si. and then transferred 
to aitroc.llulo.e paper. She piper was then probed with 125 I PKA 
(peanut agglutinin) , «5j WGA ' (wheat g.ra agglutinin), or 125 I EPA 
(Helix pomatia agglutinin). 



Iirure_5: I=ur.cpre = ipit £ tion and ir=unohlots of the partially ar.d 
° '^Zl±^lJ±Z±E?2±^^ the 125 J extensively .tripped aucir. 
was i=unoprecipitEt>ed with Stf-3 (track 3), K^G-2 (track 2) or K32 
redius as a control (track 1) by the protein A plate aethod (see 
Material, and . Ifethods ) . B, the partially .tripped true in (track 1) or 
extensively .tripped aucin (track 2) was run on SDS polyacrylaaide 
Cel. and tran.ferred t,o nitrocellulose paper. The blot was then 
^reacted with a- cocktail of and SK-S aonoclonal antibodies and the 

binding detected using an Z1ISA aethod. 



ri£Hre__§: Reactivity of aonoclonal antibodies SK-3 and HL7G-2 with 
■ethacara fixed br.a.t tissue and tuaour lection, u.in* an indirect 
icsunoperoxidaae .tainin* aethod. Infiltrating ductal carcinooa 
showing atron* reactivity with both (A) and (B) 

Fibroadenoaa ahowin* ao reactivity with EW-3 (C) and stroii* 
leterogen.ou. .tainlng of the epitheliua with lQ£7G-2 (D). Papilloaa 
•howinx T.ry weak reactivity with Qf-3 (Z) and strong po.itivity with 
KWG-2 (F) . Both noraal re.ting brea.t (G) and lactating breast (I) 
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. M .tiT.V n .t.in.a .Uh SK-J. „ h , r .., boti tl .. u . i 

positively »ith with lactatiae br...t /,» v . 

*ncwin{ ox-cast (J) mich «tron£er tia.n 

aoraal re«tix>* breast (H) . 
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Figure Legends 



figure 7. Periodic acid-silver stained milk mucin after antibody 
affinity column and gel filtration column. Milk mucin was purified on 
an HMFG-1 antibody affinity column (lane 1) followed by passage 
through a G75 Sephadez column (lane 2), subjected to 
JiaDodSO^/polyaerylaxide gel electrophoresis, and silver stained 
following treatment of gels with 0.2% periodic acid. 

Figure p. Silver stain of partially and totally stripped core 
protein from milk mucin. The purified milk mucin was deglycosylated 
by treatment with anhydrous hydrogen fluoride for 1 hr at 0*C (lane 1) 
and 3 hr at room temperature (lane 2), separated by electrophoresis 
through a NaDodSO^/polyacrylamide gel (10%) and silver stained. 

Figure q . Immunoprecipitation with MAbs of in vitro translated 
protein products from MCF-7 poly(A)* RNA. Poly(A) + RNA from HCF-7 
cells was translated in a rabbit reticulocyte lysate system (Aaershaa) 
in the presence of [ w ^S]methionine .(1 000 Ci/ssnole; ICi = 37 GBq) 
following the mamif actnrer •a conditions. Samples containing 5x10* 
acid precipitable cpm were precipitated with MAbs SU-4 (lane a), SH-3 
(lane b) , Hl£FG-2 (lane c), H£FG-1 (lane d) and an irrelevant MAb to 
interferon (lane e f 24 ) 9 separated on a MaDodSO^/polyaerylamide gel 
(10%), impregnated with Amplify and exposed to XAR-5 film at -70 # C for 
20 days. 
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Figur 1Q. Insuaoblot analysis of fusion proteinB froa the JLsuc 
clones. The phage clones S, 4, 6, 7, 8, 9 and 10 were used to 

lysogenize bacterial atrain I 1089. Imogens were grown at 32*C, 
■hifted to 42 and then induced with IPTG. Imogen proteins were 
fractionated by electrophoresis through a NaDodSO /polyaerylamide gel 
(7.5JC). transferred to aitrocelluloBe , and reached with a£FG-2. The 
binding was detected with an ELISA oethod using 4-chloro-1 -oaphthol as 
the aubstrate. The numbera are those of the X clones. 



■Fipurell. .Jiybridization of pVUClO to cDKA inserts of p«?C clones. 
ENA from the plasmid clones was digested with restriction enzyme ZcoRI 
to excise the cDNA inserts, separated by «2ectrophoresiB on 1.4% 
agarose and transferred to Biodyne nylon membrane. The filter was 
hybridized using standard conditions (St) *o the insert from piTJClO 
which was labelled with (a- 52 p]dCTP by the method of random priming 
(41). Lanes: plaBsid clones 3,4.6.7.8,9.10. 



ucm 



Figure 12. RNA blot hybridization analysis of masaary breaBt e; 
eRNA. 10. pg of total RKA from human breast cancer cells UCF-7 (lane 

1) and T47D (lane 2), normal human masmary epithelial cells Hul£E (lane 

* . 

S), human embryonic fibroblast. ICR? 23 (lane 4), Daudi cella (lane 5) 
and carcinosarcoma HS578T cells (lane 6) were separated in a 1.3% 
agaroee/glyoxal gel, blotted on to nitrocellialose and hybridised to 
the plTOCIO ZcoRI insert which was labelled with [c-^pJdCTP by the 
method of random priming (41). The size markers were 28S (5.4 kb) and 
18S (2.1kb) rRWAa. 
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Zi£ureJL2- Polymorphic human DKA fragments detected by 
hybridixation with pUUC1 0 probe. Genomic. DNA samples prepared from 
the white blood cells from ten individuals (six unrelated) and from 
thr e cell lines were digested to completion with Hinfl and ZcoRI, 
fractionated by electrphoresis through 0.7% and 0.6* agarose, 
respectively, and tranaferred to Biodyne nylon ■embranes. The filter 
was hybridixed to the pKUCIO DNA insert which was labelled with 
fa p]dCTP by the method- of random priming (41). X-ray film was 
•xposed for 1 day at -70'C with intensifying screens. Lanes 1-4 
father, two daughters and mother, lanes 5-10 unrelated individuals, 
lane 11 is MCP-7, lane 12 is 2R75-1, lane 13 is. 1CRT-23. The DNA 
sa=ples exhibit a wide distribution of sixes. Numbers indicate length 
of DNA in kb. The apparent bands at 23Zb are in lanes 12 and 13 are 
artefacts introduced in autoradiography 
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ExamDle 1 



Purification of the cllk B»Hn 



The cilk cucia was purified froa huaan akimaed silk by passage 
through an HK?G-1 affinity coluan followed by aixe exclusion 
ehxoaatography. Ihe »£7G-1 ssonoclonal antibody was purified from 
tissue culture aupernataat using a protein A eolusn (1). Use purified 
antibody was coupled to cyaaof en broside aetirated eephareee 
(Pharmacia) as deBcribed in the nanuf acturer «s inetructions . Human 
skinned cilk was passed in batches of 100 ad through the antibody 
cenu=n followed by extensive washing with TBS.' "Bound tcti t cr. was 
eluted fro= tht colunn uxing 0.1 K glycine pK 2.5 and the fractions 
registering an optical density at 2E0na were pooled, dialysed against 
0;25 if acetic acid and lyophilixed. Batches of about 20 cgs were 
dissolved in 0.25 il acetic acid and passed through a G75 Scphadex 
colu=n (1 x 100 cb) which had beon previously equiligrated with acetic 
acid. The colu=n was washed with 0.25 U acetic acid and Inl fractions 
collected. The peak fractions which were eluted in the void voiuae 
were pooled, lyophilixed and the dry powdor atored at 4'C. /aino acid 
analysis was performed using a Beckaan 6S00 aaino acid analyser. 



PeglTCosylatlon of the cllk cucin 

To reaove the O-linked carbohydrate froa the nilk anscin the 
■olecule -was treated with anhydrous hydrogen fluoride aa described by 
Kort and I*aport f22) . for either 1 hour at 4*C which produced a 
partially stripped preparation, or 5 houra at rooo terrperature which 
produced the extemirely atripped nucic. 
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lodination of the cilt nuein 

lodinations of the purified aucin, the partially or extensively 
■tripped aucin were carried out naing the Bolton and Hunter .ethod 
(51). Briefly, the aucin, 2.5 pg in 20 |il 0.1H borate buffer pH 8.5. 
w " added t0 tt » dri « d Bolton and Hunter reagent (1 »Ci, laershaa 
International pic) and incubated at pooa temperature for 15 ainutea. 
Ihe reaction was stopped by the addition ef 0.5 al of 0.2H glycine in 
borate buffer and after a further 15 ainutea incubation, free Bolton 
and Hunter reagent was renoved by pas a age through a G25 Sephadex 
column (PD10 eoluMB, Ffcanaacia) previously equilibriated in PBS. 



lodination of Lectins 



Wheat gero agglutinin (WGA), peanut agglutinin (PJiA) (Tector Labs) 
and Helix pomatia agglutinin (HPA) (Boehringer) were iodinated as 
described by larlsson et al. (52) using the chloraaine T aethod. 



Pol vacrvl acid e K c1b and Western biota 

Polyacrylaside gel electrophoresis and iaaunoblotting was 
performed as described previously (1 ) . Briefly, aaaples were run on 
5-1 S% polyacrylanide gels and then alectrophoretically transferred to 
nitrocellulose paper (Schleicher and Schuell) at 50 rolts overnight 
*t 4*C (36). In the iaaunoblotting experiments the paper was reacted 
with aonoclonal antibodies and binding detected with an ZLISA aethod 
wing 4-chloro-1-naphthol as the substrate. For l.ctin binding 
■tudies the Western blots wars reacted with the iodinsted lectins aa 
described by Swallow mt al. 68). 
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oduction of aonoclonal *r>tihnA< »m 



A female BALB/e mouae was iaaunixed with 5 »g of the partially 
■ tripped milk mucin la rr.und'a complete adjutant and S .oath, later 
boosted with a further 5 ug of the .ase preparation ia Preund's 



iacoaplste adjuvant. After a further 20 days, 5 « of the arocin 

ir.ly .tripped of it. carbohydrate was given intravenously ia 
e aolutioa. the spleen was reaoved 4 dayB later, and fused with 
the NS2 souse siyelooa cell line (53). 



Screening of hvbrido= B e-jpereatar.t and i=u.noT>reciT>lt„tj „ 



Tne screening aaaay tab a modification- of that described by Jielero 
and Gonzalez-Rodriguez (54^. ahUtiwell plates were coated with 50 ul 
ef 0.1 mg/ml protein A (Pharmacia Pina Cheaicals) ia PBS aad allowed 
to dry overnight at 57 'C. Ihe plates were blocked with 5% BSA for 1 
hour, at S7'C followed by the additioa of 50 yl of rabbit anti-mouse 
immunoglobulin (DAXO, diluted 1:10 ia PBS /BSA - PBS/BSA). After 
iacubating for 2 hours at 57*C the plates were washed twice with PBS 
containing 1* BSA and 50 ,il of hybridoaa supernatant added. Ihe 
plates were incubated overnight at 4*C, washed twice with PBS/BSA and 
50 K l of iodiaated partially .tripped srocin coataiaiag 100.000 cpa 
added to each well. 5Sie plates wars then incubated at rooa 
teaperature for 4 hours , washed 4 timea with PBS/BSA and the 
individual wells eounted in a gamma counter. For iaaunopr.cipitation 
•xperiaents 50 al of SDS saapl. buffer containing dithiothr.itol was 
*dded to each of the wells which wars then boiled for 3 .inutes and 
the buffer loaded onto 5-1 5% polyacryl aside gradient gel 8 . 
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Staining cf tissue sections 

Tissues from primary masnary carcinomas, benign breast biopsies, 
normal breast, and pregnant lactating breast tissue were fixed in 
■ethaeara (methanol chloroform and acetic acid 60:30:10) and ambedded 
into paraffin wax. Sections were stained with the antibodies using 
the indirect peroxidase anti peroxidase method as previously described 
(47). 

Results 

Purification of the milk mucin 

The milk mucin was purified from human skimmed milk on an HkEFG-1 
antibody affinity column. lodination of the eluted material revealed 
the presence of a large molecular weight component and a 68ZD band. 
Precipitation of the affinity purified material with antibodies H£?G-1 
and »£?G-2 (tracks 2 and 5) followed by gel electrophoresis showed 
that both the high molecular weight components and the 6BZD component 
were precipitated by both antibodies (less effectively by H1EPG-2). 
Since the 6 SID component was also precipitated by two unrelated 
antibodies (figure 1, tracks 3 and 4) and this component was mot 
wrident on an iasunoblot of the purified material reacted with HM7G-1 
(figure 2A), the 681 component was removed by molecular sieve 
chromatography on a 075 column. The final purified product showed a 
major high molecular weight band, with only a trac e of the 6BX 
component and a minor contaminant around 141 (figure 2B). 
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A high aolecular weight glycoprotein (PAS-O) containing aore than 
50% carbohydrate in O-liakag. ha. been purified fro. the human milk 
fat globule by Sbimixu and lamauchi (8). To ... whether this 
eoaponent wa. .iailar to the mucin i.ol.ted from ailk by affinity 
chromatography en an tt£? 0 -i affinity column, the amino acid 
eoapo.ition of the purified jars-l reactive mucin was determined and 
eoapaxed to the amino acid eoapoaition of the purified FAS-0 
component. Table 1 .hows that there i. good corre.pondence between 
the two .ets of data, indicating that the core proteins cf PAS-0 and 
the mucin purified here are the »aae. \ " 



leolation of the core protein nf the milt mri* 

Aa there are no enzymes eaaily available that are efficient at 

—111 flaked «ugar. , aad 0 elimination often result, in damage to 
the^roteinjso re , the oligoa^ echaridea were t^T^Tt^SSt"^ 
t he mucin witTanIyd7o^"h7dTo^ fluorid e . ~W^reatment has be» 



by Mart and Lamport (211 to be effective in removing .ugar. from 
Pig .ubmaxillary auc in without damaging the protein core. Amino acid 
analyai. of the material produced after HP treatment cf the milk fflucin 
•ugge.fd that the protein cor. wa. al.o in thi.. e a.e «nda»ag.d, ,ince 
the coapoaition was the .ame aa that in the intact mucin (Table 

laitially the ailk mucin waa exposed to HP for only 1 hour at 4\ 
but analy.i. of the produet .howed that there wa. only partial removal 
cf the rugar. with «uch treatment, and it wa. aec.a.ary to treat the 
m:cin at room teaper.ture for 3 hours to obtain a aol.cule which 
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showed no lectii^Einding ability, figure 3 showB an autoradiograph of 

the iodinated products after treatment for 1 hour at 4* (track 2) or 3 

hours at RT (track 1). It can be seen from figure 3 that the milder 

treatment results in a fixture of products cade tip of high molecular 

weight aaterial which is slightly acaller than the intact mucin a 

scber of smaller bande. After longer exposure to H7 at roon 

temperature , the high molecular weight bands disappeared resulting in 

polypeptide bandB of about 6 BED and 7 2 ED. 

To test for the presence of sugars on the iattct s:ucin and on the 

products produced after the two different KF treatments each 

preparation was subjected to acrylanide gel elccxrophsre sis , 

12^ 

transferred to nitrocellulose paper ar,d react*a vitb "3-labtlled 
lectins. The lectins used were petnut lectin (PNA) vfcich reacts with 
galactose linked to K-acetyl galactosanine f whett germ (*GA) reactive 
with K-aeetyl glucosamine and Helix pocatia agglutinin (KPA) which 
reacts with the linkage sugar in O-linked glycoeyletion, 

* 

^'-acetylgalactosamine, figure 4 chows autoradiographs of the reacted 
blots, and it can be seen that while treatment with K? for 1 hr at 4* 
(track 2) altera the lectin reactivity of the true in, carbohydrate is 
still present. Interestingly! however, there is a such lover level of 
binding of PKA to the high molecular weight material of the partially 
■ tripped nrucin than is seen with the intact mucin (track 1). 
Moreover, this loss in PXA binding ability ia accompanied by binding 
of the linkage sugar specific lectin HP A. This lectin shows no 
binding at all to the intact mucin, and the changed pattern of lectin 
binding after limited treatment with E7 indicates that sugars masking 
the 0-linked N-acetylgalactosanine have been stripped off. The 
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' «=all.r corpone^een ^ both the intact rucin^.ck 1} ^ la ^ 
partially .tripped preparation (track 2) i. « apoprotein which 
r.act. with »SA. although not with PJJA. Thi. »ay correspond to the 
component of .isilar .olecular weight (around 681) ...n after affinity 
chxo=atography of the rucia and Bay repr ., eat „ i^teroediate 
precurior aoltcule. 

Jigure 4 .hows dearly that tie £61 and 721 conpon.nts produced 
after extensive tr.atnent with ». (5 hr at XT), .how »o peactirity 
with the lectins (track 3), including the ^'-acetylgalactosamine 



specific lectin HP.A. 1Mb enervation constitute, .trong evidence 
that the «ugar« have bean re: ? ve:froin at least the najority of the 

eolecules, #->d we will ***** <vj_ ^, ~ 

wixi refer to this preparation aa the extensively 

stripped nueia. 



Generation of eor.oclonpT zr.tih a ii..\ 0 the ellfc g ., ein eBr « ? „,+ .<„ 

A fusion was carried out using the .pleen of a aouae that had been 
i=unized with two injections of the partially .tripped eilk »ucin 
follo^d by a boost with the extensively .tripped micin. The clones 
»ere initially screened against the 125 I partially .tripped naterial 
using protein A plate, (aee tothoda). Jcur hybridona. were .elected 
and cloned, and table 2 .how. their .pectna of reactivity with the 
intact, partially and .xtensively .tripped rmcia. Am can be .e.n fro= 
thi. tabl. three cf the hybridoma. which wer. i.olat.d .bowed a .trong 
reaction with the partially and .xtensively .tripped smein and did not 
r.act with the intact «ucin. Ihe.e appeared to be good candidate, for 
■oaocleaal antibodie. to the protein core and two, EM-3 and 
■elected to be characterised further. 



t were 
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It an al e ae8a froa table 2 thAt ^ g-1 and HM?G-2 
antibodie. reacted very .trongly with the mucin .tripped of its 

arbohydrate. The., two antibodies were, la fact, developed usiag the 
intact ameia (tram the milk fat globule) aa immunog.a and. ia the case 



of SkTG-2. frowiag mamaary .pith.lial cell. (14). Ih.ir reaction with 
the .tripped mucia wxa unexpected, aa circumstantial wvideaee had 
prerioualy led to the belief that carbohydrate might form at least 
part of their antigenic epitope*. 



Molecular Weight of C ol eeu l« B e ,rn y lng antigenic detemt^^. 

The antibody ai-S was ahowa to be of the IgGl .ubdass, while the 
SM-4 antibody was found to be Igk. We therefore choae to tiae the SH-3 
antibody in .ubsequeat experiments aince aatibodiea of the Iglf class 
can present problems in aoae appliction. Immuaoprecipitatioa of the 
extensively .tripped material with EM-S ahowed a reaction with the 
lectin unreactive 681 cocponent (Figure 5A, track 3). She monoclonal 
antibody HV£?G-2 can al.o be .een to immune precipitate the 
lectin-unreactive 6BZ coaponent (track 2). She antibodies were 
reactive with antigen on iamunoblote and Wgure 5 B .how. the reaction 
of antibody Si-5 with the dominant 681 band of the exteaaively 
•tripped mucin (track 2). 

4 ^ 

We have previoualy ahown that the molecular weight of the 
component, la brea.t cancer cell, carrying determinant, found en the 
■ilk mucin i. lower than 4001 and can vary from one tumour to another 
(1). Reaction of antibody SU-S with' Western blot, of gel ..parated 
extraeta of breast tumour cell, .hows that thi. antibody react, with 
component, of .imilar molecular weight to tho.e reactive with antibody 
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HK7G-2 (data n»£hown). Because the antibody 3 differ. fre= the 
antibodies HM?G-1 and 2 in that it doea not react with tie Intact 
mucin proceed by the lectating gland and yet reacta with molecules 
processed by breaat cancer cells . it via appropriate to axaaine the 
reaction of SM-3 with a range of breast cancer.. 



ours 



seen 



Reactivity of SM-S with hrmnm t tl^ti— . n A 

The antibody SU-3 reacted with paraffin embedded tissues provided 
these were fixed in methaearn (not foraal aaline). Using this method 
for preparation of tissue sections, the reaction of the antibody was 
compared to that of KM7G-2 on breast ti.aues and tuaours with an 
indirect iaaunoperoxidaae staining method, die analysis showed a 
draaatie difference in the staining pattern of SH-3 compared to that 
with affG-2. Ifaua, although a strong positive reaction was seen 
In 20/22 breast cancers stained with SM-3 (as coapared to 22/22 
stained with lfl£F&-2), normal resting breast, pregnant or lactating 
tissues and aost benign lesions were largely tmatained with SU-3 but 
were stained with Hi£FG-2. Soae exaaplea of staining patterns of breast 
tissues and tuaours are illustrted in Pigure 6. 

Twenty-two priaaxy carcinomas and fourteen benign lesions were 
•xaained and the reaction of SM-3 compared to the ataining with HlffG-2 
In each ease. In the primary carcinomas, staining with SM-3 was 
heterogeneous but (en.rally quit, atron* and alway. confined to tuaour 
eslls; connective tissue and atroma shows d no reaction (see figures 
6A.B). In the four fibroadenomas .xaained. ataining cf the apitheliun 
with BH7G-2 was strong although heterogeneous. In comparison, 
staining with SM-3 was negative in one ease and in the three othera 
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staining was coined to only one or two glandS^ lements. H£»3. 2 
showed strong positivity on the fire papillomas and five eases of 
eystic disease studied while the staining observed with EM-3 was very 
■uch weaker and sore heterogeneous (figures 6G.H). She papillomas „ 
. a group showed the strongest staining with SU-3, and it ean be aeen 
that the staining was membranous or extracellular. 

la contrast to BffG-1 and H£?G-2 which .trolly .tain laetating 
and pregnant breast. Qf-3 was totally negative with three out of six 
cases of pregnant or laetating breast (aee figure 6C and D). Two 
positive cases ahowed only very weak staining of an occasional cell 
and in the third, staining was confined to two areas of one lobule. 
.Again, in contrast to »£?G-1 and HJCG-2 which do react with some 
terminal ductal lobular units of normal, resting breast (albeit 
weakly) , SM-3 was totally negative on eight out of the thirteen eases 
tested and in the other fire cases staining was axtreaely weak and 
often confined to one or two acini in the tissue aection (see figure 
6E end T) . It ahould perhaps be noted that the intensity of staining 
with Hl£7G-2 aeen with normal breast tissues and benign lesions fixed 
in aethacarn was aomewhat higher than that reported previously using 
formalin fixed material (50,4"3. 

SM-3 was also shown to be negative on aeetions of normal livar, 
lung, thymus, aweat gland, apididymus. prostate, bladder, small 
intestine, large intestine, appendix, thyroid and skin. Ibe antibody 
showed weak positive staining only with the distal tubules of the 
kidney, the occasional chief cell of the stomach, the occasional duct 
cell of the salivary gland and the sebaceous gland. 
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large aolecular weight sue in aolecules are expressed by s;any 
carcinomas and carry mtnj of the tuaour assoeisted antigenic 
determinants recognised by eonoclonal antibodies. Ihese epitopes cay 
also be expressed by iooa aored epithelium, and soae nonoclonal 
antibodies like 2D£?G-1 react particularly wrll with a aucin found in 
normal huaan cilk (1,17). JLb Ion* as the study of the mucins is 
restricted to their detection with antibodies reactive with undefined 
epitopfs, the knowledge of their structure, expression and precising 
w-ill also be restricted. Te have begun to investigate the structure 
and expression of the na=try nucin by isolating the core protein and 
developing antibodies which have allowed us to select partial cDXA 
clones for the gene coding for the core protein . This Example 
describes the production and characterixation of these antibodies. 

Treataent of the affinity purified milk cucin vith hydrogen 

fluoride resulted in the appearance of a dominant band of about 6BI 
dal tons .and a cinor species of about 72U) on SDS acrylaaide gels. 
Theae bands ahowed no reactivity with lectins, including Helix ponatia 
agglutinin which is specific for K-acetyl galactosaaine , the first 
augar in O-linked g lycosylation (55). it therefore seems probable 
that this 6BX dalton polypeptide represents the core protein of the 
xucin. Supportive evidence for this coaes from the observation that 
the antibodies described here, which are reactive vith the atripped 
6BX component, can precipitate a molecule of this site froa the in 
▼iiro translation products of cRKA isolated from breast cancer cells 
expressing the auein . 
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As the milk mucin contains at least 50% carbohydrate ( 16 ) ,a 
protein core of only 6 BED appear* too anall if the intact molecule has 
an observed molecular weight greater than 400H5. However, mucins cam 
be cocposed of aaall eubunits which aggregate and are held together by 
soae form of xton-covalent interactional as yet not tznderstood. ?or 
•xaaple, although the molecular weight of the ovine aubmaxillary mucin 
haa been reported to be greater than 1 x 10 6 daltons (45^ it haa a 
protein core of only 650 aaino acida with a molecular weight of 58,300 
daltone (^g ) . 

An unexpected finding was that the antibodies tt£?G-1 and &£?G-2 
which react with the milk mucin, also ahow a positive reaction with 
the extensively stripped material which ahowed no lectin binding 
capability. Previous indirect evidence, including the resistance to 
fixation, boiling and reduction, the repetitive nature of their 
epitopes and the appearance of aeveral bands on imaunoblots , had led 
to the belief that carbohydrate present on the ailk aucin was involved 
in these epitopes. 1Mb idea vas reinforced by the observation that 
lectins could block the bindin£ of H1£FG-1 and 2 (1). While it is not 
possible to exclude the possibility that some augers, undetected by 
the lectin binding experiaents, remain on the extensively stripped 
aucin described here, this la unlikely to be the explanation for the 
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r.actirity of the antibodies B?S-1 and 2. Shi. can be .aid since 
both antibodies have recently been ahowa to react positiv.ly with 
t-galacto.idaa. fusion protein, .xpr.ss.d by phage carrying SKA coding 
for the core protein of the «a»ary a^i* . It appear, therefor, that 
at leut part of .ach of the .pitope. recognised by EH7G-1 and 
contain amino acid, but it must be assumed that .ome of the.e .pitopea 
on the cor. protein are .zpo.ed. i... not maaked in the fully 
glycosylated molecule. The H^G-2 .pitope i. however 1... abundant ea 
the milk mucin than the HKFG-T .pitope, while it i. readily detectable 
cn the mucin molecules .xpre.eed by toour, (1 ) ; n,,,, molecules have 
* amall.r molecular weight and may be 1. 88 heavily glycosylated or 
polymerized. 

Her. we have reported the development of new antibodi.. which ar. 
reaotir. with the protein core of the mucin and with the partially 
deglycoaylated molecule, but which are unreactiv. with the fully 
processed mucin produced by the lactating ma»ary gland. One of these 
antibodies SU-3. which i. an Ig Gl . ha. been .tudi.d In more detail. 
It ha. been ahown to react with the mucin molecules which ar. produced 
by breast cancer cell, and ar. recognised by many antibodi.. developed 
agiinst the intact milk mucin. It .hould b. •x.phasis.d how.v.r that 
th. .pitop. r.cogni..d by SU-3 which i. on th. cor. prot.in and i. 
•zpo..d in th. mucin as processed by tumour c.ll,. i, aot .^.^ „ 
th. normally proe.sa.d milk mucin. Jhis f.atur. offer, th. 
Possibility of .ahane.d tumour specificity, and a pilot 
iinmunohi.tochemical study of breast tumour, and ti.sue. ha. .hown that 
indeed the SK-S antibody react, strongly with the majority of primary 
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breast cancers (91*) but .bows little or no reaction with benign 
breast tumours, resting or laetating breast, and most normal tissues. 

There are several implications of the work described here which 
■ay be important for both basic and clinical studies la breast cancer. 
The ob.enration that parts of the core protein (detectable by 
antibodies) are exposed on the smcins as processed by breast cancer 
but Basked cn the uucin as processed by cells in normal breast and 
benign tumours implies that there is an alteration in the processing 
of the mucin in aalignancy. A Bore detailed study of the processing 
of the mucin in normal and malignant cells may -then give baBic 
information for defining the malignant cell. Moreover, since the 
specificity of the reaction of the antibody Si-3 for tumours is better 
than that of antibodies developed against the intact aucin, this 
antibody may proTe to be a more effective diagnostic tool for the 
detection of breast cancer cells in tissue sections, tissue fluids and 
cells. The reactive components are membrane associated as veil as 
intracellular and in vivo localisation of tumours may also be 
possible. 



She abbreriations wed are: Ek£?G. human aiilk fat globule; PBS, 
phoaphate-buffered ealine (153 a* RaCL, 3 wM ICL, 10 a* Ha.HPO , 2 a* 

^Z* 0 * PH 7,4)1 WG1 ' * h6Rt * ena *«Sl»tinin; FJU, peanut a«lutijiin; 
EPA, Helix pomatia agglutinin; BSA, feoTine aersa albuaia; SDS, aodiua 
dodecyl aulfate. 
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Example 2 

Purification and deglycosylrtion..of hu*nan milk mucin was 
conducted as in Example 1 mucin was purified on an HMFG-1 
antibody. 

Ihe stripped mucin preparations were separated by electrophoresis 
through EaDodSO^/polyacrylaaide eels (10JC) and ailrer stained by two 

* 

methods, one of which can be rued to stain highly glycosylated 
proteins (22.23). 



Preparation of polyclona l rabbit aati serum to stripped core -protein 

One New Zealand fhite rabbit was immunized with 100 jig of the 
partially stripped core protein in complete Freund's adjuvant (Gibco). 
Booster injections of 500 jig of the totally stripped core protein were 
administered in incomplete Freund's adjuvant (Gibco) 3 and 4 weeks 
after the initial injection and the rabbit was bled one week later. 
Ten microliters of immune serum (75 jig/ml protein) precipitated 200 ng 
of fully stripped core protein in a Protein A assay (24) and detected 
it on immunoblote . The immunoglobulin fractions of rabbit preimmune 
and rabbit anti-sueia core protein were prepared by adding ammonium 
aulfate to 50% saturation. The resulting pellet was resuspended in 
one-half the original aerum volume of PBS and dialyxed against the 
same buffer. After dialysis, only residual precipitate was removed by 
eentrifugation. Immunoglobulin fractions we.«e stored in aliquots at 
-20*C. 



Substitute sheet 



WOS8/U5054 PCT/GB88/00011 

■■"*»»■ • \- " .' 

*■ ■ v «■ . • . 

J L 
i 

-37- 

Deacription of KAba mad 

In addition to the polyclonal antiserum. used for initial screenina 

• ^ e* 

a cocktail of two MAbs, SM-3 SHmm4 Jtxanple 1) which recognise the 

mucin core protein (20) and HK7G-1 and HU7G-2 (1.1-4) w«re used to 
acrsen the purified plaques, the 0-galactoaidase fusion proteins and 
for iaaunoprecipitationa from in Titro translated protsina*. Other 
MAbs used were a Monoclonal anti-p^g alactosidase antibody (25) which 
was a gift froa H. Itarbin (ICR?, London), an anti-interf eron antibody, 
ST254 (24), 1261, a keratin antibody (26) and M1 6 which recognises a 
carbohydrate structure on the ailk aucin (27). 

In Titro translation of proteins 

RKA was isolated froa the huaan breast cancer cell line MC7-7 
using the guanidiua iaothiocyanate aethod of Chirgwin at al. (28) and 
poly(A)"*" RKA was purified by chroaatography using oligo (dT)-cellulose 
(Haw England Bio Labs) . The poly(A) + RHA waa translated in a 
reticulocyte lysate systea (Aaershaa) in the presence of [ 55 S] 
aethionine (1000 Ci/c=ole; 1 Ci * 57 GBq, Aaershaa). Saaplea 
containing 5x10* acid insoluble cpa were precipitated in a protein A 
assay (24) using MAbs SU-3, Slf-4, mCTG-1 , H1CTG-2 and a control 
antibody to huaan interferon. Ihe antibody^electad proteins were 
than separated on a 1 0X HaI>odS0 4 /polyaerylaaide gel, iapregnatad with 
Aaplify (Aaershaa) and exposed to XAR-5 fila (Xodak) at -70 *C. 

The MAbs SM-3 and SM-4 (SM r%t^TB to stripped smcin) show strong 
reactirity with the partially and fully stripped core protein but no 
reactivity with the fully glycosylated aucin (20). 
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Antibody screening of l gt11 library and protein blotting 

The igtll library used ia thiB study was constructed fro= aJLKA 
isolated from the human breast cancer cell line MCP-7 and was 
fenerously proyided by Philippe falter and Pierre Chaabon (Strasbourg, 
Prance). The poly (A) 4 EKA used for the preparation of the randoaly 
priaed library na prepared froa aSMA that sediment ed faster than 28S 
rRUA and was enriched ia estrogen receptor (29). The library was made 
essentially as described by Huynh et al. and Toting and Davis (50-32) 
and contained approximately 1 x 10 6 recombinants per |ig of UNA. 
Between 85% and 95% of the plaques contained inserts. 

The phage library was plated onto bacterial strain 11090 and grown 
for 3 hr at 42 *C. After isopropyl P-D-thiogalactoside (IPTG) 
induction and 3 hr of growth at 37 filters were prepared from each 
plate and screened with anti-mucin core protein antibody by the method 
of loung and Davis (32). The first antibody used in screening vas the 
rabbit antiserum raised againBt the stripped core protein prepared as 
described above. Prior to use in screening, the antiserum vas diluted 
1:200 in PBS containing 1% bovine serum albumin (PBS/BSA) . 
Preabsorption with 11080 bacterial lysate was not found to be 
necessary. The nitrocellulose filters (Schleicher and Schuell) were 
blocked by incubation in PBS containing 5% BSA for 1 hr at room 
temperature with gentle agitation. The filters were incubated at room 
temperature overnight with a 1:200 dilution of antiserum ia heat 
scaled plastic bags. The filters were washed 5x5 min ia PBS/BSA, 
and bound antibody was detected by using horseradish 

peroxidase-eonjugated sheep anti-rabbit antiserum (Dako) diluted 1:500 • 
with PBS/BSA and incubated for 2 hr with the filters. The filters 
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were washed 5 




cin in PBS/BSA and 1x10 ain In P3S before color 



detection using 4-chloro-1 -oaphthol (1), Inaunoreactive bacteriophage 



were picked and purified through two additional rounds of screening . 

Subeequently, bacteriophage Inserts were subcloned into the EcoRI 

• ites of pDCB (33) producing the plaanid used aost eztensively, pKUC 

10. The plassids were maintained in SUi cells. 

To az amine the p-galactosidaae-cDNA fusion proteins for 

iicsranoreactivity, cell lysates were derived, lysogens were prepared 
as described in Toung and Davis (34). Cells were pelleted, suspended 
in Lae=li saaple buffer (35) and separated by electrophoresis through 
HaDodSO^/polyacrylaaide gels (10%) and transferred onto nitrocellulose 
filters as described (1,36). The filters were treated as above for 
antibody screening. 

Korthern Analysis 

RNA was isolated fron tissue culture cells and frozen tissues by 
the euMidiniun isothiocyanate method of Chirgwin at al. (28). Total 
RtfA (10 jig per lane) was denatured by heating at 55 # C for 1 hr in 
deionized glyozal and fractionated by electrophoresis through a 1.3% 
glyozal gel (38). de RNA was transferred to nitrocellulose 
(Schleicher and Schuell), prehybridized and hybridized as described by 
Thomaa (34). ?iltera were washed down to 0.1X SSC with 0.1% SDS at 
65*C and exposed to XAK-5 film (lodak) at -70 # C with intensifying 



screens . 
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Southern analysis 

High molecular weight genomic DNA was prepared from white blood 
cells and cell lines (59,40)* These genomic DNAs (10jig) were cleaved 
with restriction enzymes following the manufacturer's recommended 
conditions and fractioned through 0.6% and 0.7% agarose gels. Cloned 
plaamid DNA wafl cleaved and fractionated on 1.3JJ agarose. The gels 
were denatured, neutralized and transferred to nylon membranes 
(Biodyne) according to the manufacturer's instructions. The EcoR1 

insert from plTJCl 0 was separated on a 1J low melting point agarose 
(Biorad) gel and labelled with [a- 32 P]dCTP by the method of random 

priming (41) and hybridized to filters at 42*C. Pilters were washed 

down to 0.11 SSC with 0.T % SDS at 55*C and exposed to 1AR-5 

(Kodak) at -70 # C with intensifying screens. 



Results 



Purification and deglvcoBvlation of mucin glycoprotein 

Ifucin glycoprotein reactive witii the monoclonal antibody HLEFG-1 
was prepared from pooled human breast milk by using an HVEFG-1 antibody 
affinity column, followed by. molecular sieve chromatography on 
Sephadex G-75 in order to remove lower molecular weight components 
(Figure 7, lane 1). In order to demonstrate the homogeneity of the 
purified molecule, amino acid analyses of four separate preparations 
were performed and revealed a fairly consistent composition with 
serine, threonine, proline, alanine and glycine accounting for 58* of 



the a=ino acids ( Periodic acid silver stain d gels 

revealed a diffuse band of greater than 400,000 daltons risible only 
when the gel was treated with periodic acid before the silver stain 
(Fig. 7 f lane 2)* Mo other lover molecular weight bands wer 
wisualixed on the gel using the silver stain without prior treatment 
with periodic acid. 

The purified material was subjected to treataent with hydrogen 
fluoride to remove the O-linked sugars that are characteristic of 
mucin glycoproteins. Two different reaction conditions were need 
which resulted in a partially deglycosylated core protein (treated at 
0 # C for 1 hr) and a fully deglycosylated core protein (treated at roon 
temperature for 3 hr) as determined by iodinated lectin binding 
following separation by gel electrophoresis and transfer to 
nitrocellulose paper (20). The partially deglycosylated core protein 
was reactive with wheat germ agglutinin, peanut agglutinin and helix 
pommatia lectin (which recognises the linkage sugar 
N-acetylgalactosamine) whereas the fully stripped protein showed no 
reactivity with any of these three lectins. 

The hydrogen fluoride treated core protein was separated by 
electrophoresis through KaDodSO^/polyacrylamide gels (10%) and silver 
stained. Silver staining revealed that the predominant component of 

* * 

the partially stripped mucin was a high molecular weight band of about 
400 kd, although faint bands of lower molecular weight could also be 
observed (Fig.B , lane 1). Since the high molecular weight material 
showed a somewhat increased mobility in the gel t ^rl reacted with the 
lectin recognising the linkage sugar, it can be assumed that some 
sugars had been removed. The fully stripped mucin consisted of two 
bandB of about 68 kd and 72 kd (Fig. 8 f lane 2). 

******* s 
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Antibody reactive proteins produced by MC7-7 cellB 

The MCF-7 breast cancer cell line expresses large amounts of 
H1£FG-1 and -2 reactire material on its cell surface (14) and was thus 
judged to be a suitable source of slRXA for a cDNA library. Before 
proceeding to screen the MC7-7 library with the monoclonal antibodies 9 
they were tested for their ability to precipitate a component from in 
vitro translation products produced from MCF-7 mRXA. Poly (A) + RKA 
from MCP-7 was prepared and translated in vitro . Proteins from the 
translation reaction vere immunoprecipitated using the monoclonal 
antibodies K£FG-1 1 K£PG-2, SU-3 and and displayed by 

poly acryl amide gel electrophoresis and fluorography (Fig, 9). Two 
proteins of about 66 kd and 92 kd were immunoprecipitated by SM-3 
(lane 2) and SM-4 (lane 1). It was also found that HMFG-1 (lane 4) 
and HMFG-2 (lane 3) immunoprecipitated these proteins; however, no 
bands in these areas were precipitated by an irrelevant monoclonal 
antibody to human interferon (lane 5). The fact that H£FG-1 and -2 
immunoprecipitated these proteins was an unexpected finding as it was 
previously thought that these HAbs recognize carbohydrate 
determinants (1). However, we also found that HUFG-1 and -2 react 
very strongly with the fully stripped, iodinatsd core protein (20). 
These results together with the UAb reactions on the p~g alactosidase 
fusion proteins (see below) confirm that the epitopes for H3J7G-1 and 
-2 are, at least in part, protein in nature • 

The abundance of the eore protein mRXA in total cellular poly (A)* 
RNA was 4% as stimated by omparing the amount of ('^S)methionine 
present as immunoprecipitated protein to the amount of methionine 
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incorporated into total protein during in vitro translation. 



Screening of the cDKA library 

The JLgtH cDNA library made from Bite selected MC7-7 mRXA (see 
Methods) was screened initially with the polyclonal antiserua made to 
the mucin core protein which had been stripped of its carbohydrate. 
Screening of 2 x 10 6 plaques resulted in 11 positive clones, 7 of 
which were taken successfully through two further rounds of plaque 
purification. 



To demonstrate that the reactivity of the phage cloneB with th 



e 



antibody probes was due to antigenic determinants on the 



cDNA 



translation product, P-galactosidase fusion proteins were made froa 
all 7 clones. The proteins were separated by electrophoresis, 
transferred to nitrocellulose paper and probed with a variety of 
antibodies to the stripped mucin, including the polyclonal antiserua 
which was used initially to select the clones and a cocktail o* 
and SM-4. In addition, and H£PG-2, the two monoclonal 

antibodies which originally detected this differentiation and 



tumour-associated epithelial mucin (1,14) were tested. All 7 cl 



ones 



yielded fusion proteins which were specifically recognized by the 
polyclonal antiserum, the monoclonal cocktail, and Hk£7G-2. H1£?G-1 
antibody reacted with 6 of the 7 fusion proteins and failed to 
recognise the protein from clone 9 which contains the smallest insert 
In ©very case the strongest signal was given by the B£?G-2 antibody 
and this reaction is shown in Figure 10. Monoclonal antibodies to 
keratins and to a carbohydrate epitope on this fully glycosylated 
mucin were used as controls and showed no reactivity 
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A monoclonal antibody to p-galactosidase was a positive control and 
the band recognized correlated In very case with the band recognized 
by the specific antibodies. The sizes of the fusion proteins varied 
in proportion to the sites of the cDNA inserts found in the 
bacteriophage . 



Characterization of cDKAs and RKA blot analysis 

The inserts from the X clones were designated plfUC3-1 0 (omitting 
plfUCS) and were subcloned into the vector pUC B for easier 
manipulation. The 7 clones were compared to each" other for sequence 
homology. Each of the plasaids was digested with EcoRI and the insert 
separated on a 1.4£ agarose gel. The largest cDNA insert from pMUClO 
was used to probe the inserts and found to hybridize to all 6 inserts 
(Pig. 11 )* pMUC 7 was found to contain two inserts following digestion 
with EcoRI; however, only 1 of the inserts hybridized to the ptfUCIO 
probe. The insert bands were not derived from phage DNA since the 
pMUCIO probe did not hybridize to Hind Ill-digested X phage DNA 

As shown by agarose gel electrophoresis (Pig ,11), the 

m 

inserts vary in size from about 200 to up to about 1800 bp. The 
largest insert from pl£UC10 has been used as the hybridization probe in 
all subsequent experiments • 

Because the UfUC clones were identified only by. antibody binding, 
we needed additional assurance that they were indeed coding for the 
breast epithelial mucin « To determine the authenticity of pHUC10, we 
correlated the presence of mRNA hybridizing to the clone with siucin 
expression in various cell lines. As shown in figure the cDNA 
hybridized to two transcripts of 4.7 kb and 6.4 kb in the RNA from the 
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breast cancer cell lines MCF-7 and T47D which were shown previously to 
express the antigen (1,14). Significantly, the pWUClO probe 

hybridized, to transcripts of approximately the loe size ia MA 
•xtracted from normal mammary epithelial cells cultured from milk 
(42). A third band of 5.7kb can be seen in the UNA from these normal 
cells. In contrast, three human cell types that lack the miein, 
breast fibroblasts, Saudi cells and HS578T, a carcinosarcoma line 
derived from breast tissue (43), ahowed no detectable pVttJCI 0-related 
&RKA. The 6.4 kb band appears to be the most adundantly expressed. 
The presence of at least two sizes of mENA f rom 4JCF-7 cells correlates 
with the inmunoprecipitation of two proteins of (molecular weights 68 
kd and 92 kd) from in vitro translated ttKNA from MCF-7 cells. The 
normal mammary epithelial cells were derived from pooled milk samples 
and the additional transcript observed may be due to polymorphisms 
among individuals. 




length polymorphism (BFLP) 

Genomic DtfA was prepared from a panel of ten individuals 
consisting of six unrelated individuals and a family of four, and from 
three cell lines. The TO As which were digested with Hinfl or EcoRI 
and blotted and hybridized to the radiolahelled plTOCIO insert, sxhibit 
restriction fragment length polyaorphiams. The restriction fragments 
from the ten individuals and three cell lines are shown in figure 13. 
She pattern consists of either a single band. or a doublet of sizes 
ranging from 3400bp to 6200bp in the Hinfl digest (with the exception 
of the ZR75-1 DNA in lane 12, figure 13a which shows three bands) or 
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from B200bp to 9600bp in the EcoRI digest (figure 13 B ). There appearB 
to be a continuous distribution of the fragment sizes which implies a 
high in vivo instability at the locus. Bie pattern of fragments 
observed in the family of four (lanes 1-4) suggests that these 
fragments are allelic. Preliminary studies of the SKA made from white 
blood cells of normal, related indiriduals indicate the sxistance of a 
number of independent alleles with an autosomal codominant mode of 
inheritance IheBe 
studies will be the subject of a separate investigation. 



The cDNA clones described here which were obtained from the KCF-7 
Igtn library were selected using polyclonal and monoclonal antibodies 
prepared against a normal cellular product, the milk mucin in itB 
deglycosylated form. This was done because it was easier to obtain 
large quantities of the mucin for stripping than to. prepare similar 
quantities of immunologically related glycoproteins expressed by 
breast cancer cells (44). The fact that the antibodies did select for 
cDNA coding for nonglycosylated core protein molecules in MCP-7 cells, 
strongly suggests that the glycoproteins in these cells, which were 
originally detected by their reaction with antibodies to the milk 
mucin, contain the same core protein as this mucin. This is confirmed 
by the detection of stftHAa of approximately the same sizes in the 
normal and malignant cells, using one of the probes isolated from the 
MCF-7 library. We will therefore refer to the antibody reactiY 
glycoproteins en breast cancer cells as mucins, bearing in mind that 
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their processing may be different resulting in moleculeB of different 
nolecular weigbtB but with' the saae core protein as that of the oilk 
mucin . 

Seven clones were obtained from the MC7-7 library of which the 
largest was 1800rb. This clone cross hybridized with the other 6 
•nailer clones. The p-galactosidaae fusion proteins expressed by six 
of the cross-hybridizing laabda clones were reactive with the 
polyclonal antiserum directed against the mucin core protein as well 
as with four well-characterized monoclonal antibodies directed to 
various epitopes on the stripped core protein, SM-3, SM-4, KJ£?G-1 and 

(14,20). The Baallest laabda clone, 11TJC9, produced a 
p-galactosidase fusion protein which reacted with three of the four 
monoclonal antibodies and with the polyclonal antiserum. 

. The surprising result that the extensively characterized ffl£FG-1 
and HSQ>G-2 monoclonal antibodies reacted strongly with the laabda 
plaques and the fusion proteins and could inaunoprecipitate proteins 
froa in vitro translated sKKA provides strong evidence that these 
clones do indeed code for a portion of the mucin core protein. 
Although previous evidence such as resistance to fixation, boiling, 
treatment with dithiothreitol and XaDodSO and the presence of 
multiple epitopes on the molecule suggested that these were 
carbohydrate (1), it has now bees established that the epitopes of the 
B£P0-1 and HUPG-2 monoclonal antibodies are definitely protein in 
nature. Carbohydrate may be required to obtain the. strongest binding, 
either as part of the epitope or by conferring some conformational 
change on the protein portion, but part of the antigenic determinant 
must consist of an aaino acid sequence. Since these two KAbs are 
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reactive with the fully glycosylated Bilk aucin as well as the 
•tripped core protein, this data aeanB that the intact molecule 

■ 

contains areas of naked peptide which contribute to the antigenic 
aites for these two antibodies. 

Confirmatory evidence that pMUCIO codes for the maaaary aucin core 
protein is provided by RXA blots. She relative abundance of mRXA in 
the breast cancer cell lines MC7-7, T47D, ZR-75-1 and in normal 
. as-naary epithelial cells corresponds to the antigen expression by 
these cells as measured by the binding of the HUPG-1 and H£PG-2 
monoclonal antibodies. Cell types which are negative for antigen 
expression such as human fibroblaBtB, Daudi cells and HS578T, a 
carcinosarcoma line derived froa breast (14), are negative in RNA blot 
hybridizations. A fortuituouB observation made with the ZR-75-1 
cells yielded indirect strong evidence that plTUCIO does indeed code 
for the mucin glycoprotein core protein. This cell line, which 
routinely expresses large amounts both of aRNA and antigen, yielded 
one preparation of RHA which was unexpectedly negative by blot 
hybridization. It was subsequently found that those particular 
ZR-75-1 cells froa which the RiJA had been aade had lost the expression 
of the antigen as well at this tiae (as determined by reaction with 
HHTG-1 and 2). Different passage number, of the ZR-75-1 cells were 
recovered and shown once again to express both antigen and message. 
Ihe sites of the messages, 4.7 kb and 6.4 kb, are quite large, «ince a 
68 kd or 92 Id protein would need only about S kb to code for the 
protein portion.. Thi. sucgeati that a lar*e portion of the aRXA 

aaybe mtranolated. Efforts are underway to obtain a full-length 
clone . 
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Thus, the eDNA donee presented here represent a portion of the 
gene coding for the hunan Ba=oa.ry auein which is •xpressed by 
differentiated breaBt tissue as well as by most breast cancers. The 
major proteins precipitated froa in vitro translation products of RKA 
froa MC?-7 cells by antibodies to the milk mucin core protein (6SId) 
hare an apparent aolecular weight of 68Xd and SZLA. These proteinB, 
produced by the breast cancer cell therefore share epitopes with the 
68Xd core protein of the milk aucin (20). Ihether a aiailar 92Xd 
protein is also produced by noraal aaaaary epithelial cells, and is 
truncated or destroyed by KF treataent is not yet clear. KCF-7 cells 
biosynthetieally labelled with 14C aaino acids yield upon 
iaaunoprecipitation with IQ£?G-1 and H1CG-2 antibodies, two 
glycosylated proteinB of 320 kd and 430 kd, and £ t 

is possible that each of these glycoproteins utilises only one core 
protein of either 6BZd or 92Zd. Alternatively, each of the 
glycoproteins could contain both the 92£d and 68Sd proteins either in 
different proportions or variably glycosylated. Further screening of 
the library aay yield full length eDNAs coding for both sizes of the 
iaaunologically related core proteins. Since there appears to be only 
a aingle gene (based on Southern blot data obtained by using a partial 
eDNA probe), it is probable that the multiple messages arise by 
alternative RKA splicing and this would explain the fact that they 
contain coaaon sequences. Although a core protein of 68 kd appears to 
be saall to yield a fully glycosylated molecule of greater than 500 kd 
which contains 50% carbohydrate, there is evidence that aueh a 
■tructure for mucins is possible. Ovine subnaxillary mucin has a 
reported molecular weight of 1 x 10 6 daltons (45), yet its protein 
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eore consists of 650 amino acids resulting in a molecule of 58 kd 
(46). 

The mucins which are detected with HVE7G-1 and HM7G-2 MAbs on 
Ismunoblots of tumours and breast cancer cell linea .how variations in 
• ize from 60 kd to 400 kd in the molecular weights of the tumour mucin 
molecule* (1,47). Using theie >aae antibodies which detect high 
molecular weight mucins present in normal urine, a polymorphism has 
Indeed been ahovn to be genetically determined (48). Although the 
very low molecular weight components are likely to represent precuraor 
forms of the mucin which appears to be incompletely processed in many 
tumour cells (20), the variations in the higher molecular weight 
components are likely to be due to this genetic polymorphism. It was 
unclear, however, whether the structural basis of the polymorphism was 
due to the genetically determined protein or to the carbohydrate 
portion of the mucin. Ihe detection of restriction fragment length " 
polymorphisms in the Southern blotting experiments using the mucin 
probe auggest that the mucin polymorphism occurs at the level of the 
BKA which codes for the protein. Preliminary sequence data 
AugeeBt that the basis for this polymorphism is a region 
of variable tandem repeats present in the protein eoding sequences. 
Ibis atructural feature may be responsible for the generation of the 
■any allelic restriction fragments at the mucin locua. We are 
presently investigating the basis of the mucin polymorphism by a 
Southern blot «urvey of DHA from white blood cells of normal, related 
individuals whose inheritance pattern of urinary mucins has been 
determined. In addition, we are examining DNA preparations made from 
the white blood cells and tumours of individual breast cancer patients. 
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to determine if there is any discordance between genotype in the . 
paired samples, since tandemly repeated PNA may provide .an unstable 
aite »here recombination or amplification could occur. 

The presence of nucins in the majority of carcinomas and their 
association *ith the differentiation of mammary epithelial cella makes 
it particularly important to identify regions involved in the tissue 
specific and developmental regulation of the gene. Moreover, the 
introduction of a functional mucin gene into cells should provide 
insights into the role of this molecule in breast epithelial 
differentiation and possibly enable ub to identify any alterations in 
the function or expression of the mucin which are related to malignant 
transformation in the human breast. 
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Abbreviations 



The abbreviations are as follows: PBS. phosphate-buffered aaline; 
MAb. monoclonal antibody ;1F7G, i.opropyl 0-D-thio£alacto.ide ; bp, 
base pair(s); Kb, kilobaae(s). 
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TABLE 1 



eino acid cocpoeltion »<> the h M ., ~«-.v - lMn _ 



eoerpftrlaon »itb PAg_n 



Anino acid 



HHFO-1 jmrifi.d XxtenaiTely .tripped 



Asp 


6.1 


7.2 


Thr 


9.4 


9.7 


Ser 


9.1 


13.0 


Glz 


6.3 


9.6 


Pro 


14.8 


14.4 


Gly 


8.1 


10.1 


Ala 


12. S 


11.9 


Cys 


Hot analysed 


Hot analysed 


Val 


6.0 


6.3 


Vet 


0.5 


0.4 


XLe 


1.6 


1.7 


Leu 


4.5 


4.8 


Tyr 


2.0 


0.9 


Pie 


2.0 


1.6 


Hie 


3.2 


2.S 




2.8 


3.5 


Al"£ 


4.0 


4.0 



(Eb. -iru & Xaaauchi 
1952) 




6.4 
9.8 

13.1 
8. 3 
12.0 
12.2 
13.0 

0.5 

5.3 

0.6 

1.9 

3.7 

1.6 

1.7 

8.8 

£.2 

S o 9 
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Table g 

Keactlrity of tb, a^lbortU, nn l Bt , et . „ T 



125 



X Cpo bound 



Intibody 



Intact aiolecul 



Totally 

itripped maetn. stripped srocin 




5.17 


8,524 


11,925 


5,780 


8.13 


525 


3,000 


3,328 




465 


15,414 


9,200 


SU-4 


616 


16,750 


9,561 


KUPG-i 


32, 000 


33,768 


9,494 


Hl£?G-2 


29,500 


29,230 


15.832 


NS2 aediua 


397 


845 


650 



The binding of the antibodies to iodiaafd intact, partially and 
totally de,lyco.ylated «ilk macin was aa.ay.d win* the protein A 
plate method as deacribed in Material! and Methods. 



SUBSTITUTE SHEET 



WQ 88/05054 -f:l) PCT/GB88/0001I 

v . -55- • ' ~ 

Reference e 

1. Burchell, J.M., Durbin, H. and Taylor-Papadimitriou, J. (19S3) J. 
Zssunol. 131 , 508-513. 

2. Braawell, M.E., Ehavanandan, T.F. , Wiseman, G. and Harris, H. 
(1583) Br. J. Cancer 48, 177-183. 

3. Mcllhinney, R.A. , Patel, S. and Gore, M.E. (1985) Biochem. J. 227, 
155-162. 

4. Hilkens, J., Buijs. F. f Hilgers, J. t Hageaami, Fh. f Calafat, J. f 
Sonnenberg, A. and Van der Talk, (1984) Int. J. Cancer 34, 
197-206. 

5. Tagliabue, E. , Porro, G. . Barbanti, P., Delia-Torre, G. , Menard, 
S., Rilke, P., Cerasoli, S. and Colnaghi, M. (1986) Hybridona 5, 
107-115. 

6. Johnson, T.G., Schlom, J. f Paterson, A.J., Bennett, J. t itagnani , 
J.L. and Colcher, D. (1986) Cancer Res. 46, 850-857. 

7. Sekine, H. , Ohno, I. and Xuf e , D.W. (1 985) J. Iimunol. 135, 
5610-3 615. 

8. Onnerod, M.G., Steele, I. 1 Edwards, P.A.W. and 
Taylor-Papadinitriou, J. (1984) J. Expt. Path. 1, 263-271. 

9. WilkLnaon, M.J., Howell, JL., Harris, J., Taylor-Papadioitriou, J., 
Swindell, R. and Bellwood, R.A. (1884) Int. J. Cancer S3, 299-304. 

■ 

10. Xiife, D. . Inghirami, G. , Abe, U. f Hayes, D. t Justi-Wheeler, H. and 
Schloa, J. (19&4) Hybridoaa 3, 223-232. 

11. Price, M.B., Edwards, S., Robing, R.A., Hilgens , J. , Hilkens, J. 
and Baldwin, R. (1986) Eur. J. Can. Clin. Oncol. 22, 115-117. 

12. Johnston, W.W. , Sxpak, C.A., Lottick, S.C., Thor, A. and Schlo=, 
J. (1985) Cancer Res. 45, 1894-1900. 



13. Rassussen. B.B.. Pedersen, B.V., Thorpe, S.M., Hilkens. j 
Hilgers, J. and Roso, C. (1986) Cancer Res. 45, 1424-1427. 

14. Taylor-Papadiaitriou. J. , Peterson, J.A., Arklie. J. f Burchell, 
J., Ceriani. R.L. and Bodaer, W.P. (1981 > Int. J. Cancer 28, 
17-21. 

15. Burchell, J. , Wang. D. and Taylor-Papadiaitriou, -J. (1984) Int. J. 
Cancer 34, 763-768. 

16. Shiatsu, M. and Xaaauchi, X. (1982) J. Bioehea. 91, 515-519. 

17. Oraerod, U.G., Steele, X. , Westwood, J.H. and Kaxzini, li.K. (1983) 
Br. J. Cancer 48, 533-541. 

18. Taylor-Papadiaitriou, J. , lane, E.B. and Chang, S.E. (1983) In, 
Understanding Breast Cancer: Clinical and Laboratory Concepts. 
(Rich, U.A., Hager, J.C. and Puraanski , P. Eds.) Marcel Dekker, 
Inc. New Xork and Basel, pp. 215-246. 

.19. Chang, S.E. and Taylor-Papadiaitriou, J. (1983) Cell Diff. 12, 
143-154. 

20. Exanple 1' above. 



21. Mort, A. and Lamport, D. (1977) Anal. Biozham^ 82,. 289-J09.' 

22. Wray, W., Boulikw, T. f Wray, X.P. and Hancock, R. (1981) Anal. 
Biochea. 118, 197-203. 

23. Dubray, 0. and Bezard. 0. (1982) Anal. Biochem. 119, 325-329. 

24. Shearer, H. , Taylor-Papadiaitriou , J., Griffin, D. ^ BalWill, 
P. (1984) J. Immunol. 133, 5096-3101. 

25. Durbin, H. and Bodaer, W.P. (1986) J. Immunol. Math., In Press. 

26. Lane, Z.B. (1982) J. Cell Biol. 92, 665-673. 



SUBSTITUTE SHEET 



WO 88/05054 ^ ^ PCT/GB88/00011 





27. Gooi, H.C., Jones, K.J. , Hounsell, E.P., Scudder, P., Hilkena , J. , 
Hilgere. J. and Peizi, T. (1985) Biochea. Biopfaya. Res. Coeaun. 
131, 543-550. 

i 

28. Chirgwin, J.li., Prxybyla, I.E., kaeDonald, R.J. and Butter, W.J. 
(1979) Bioehem. 18, 5194-5199. 

29. Waiter, P., Green, S., Greene, G. , Xruat, A., Boraert. J.-M., 
Heltseh, J.-M., Staub, A., Jensen, Scrace, G. , Waterfield, ii. 
and Chaabon, P. (1985) Proe. Katl. Acad. Eci. USA. 82, 7889-7893. 

3 0. Huynh, T.V., Toung, R.A. and Daris, R.W. (1985) In, DNA Cloning: A 
Practical Approach, ed. Clover, D.li. (IRL. Oxford), Vol. 1, pp. 
98-121, 

31. loung, R.A. and Davis, R.W. (1983) Proc. Katl. Acad. Sci. USA 80, 
1194-1198. 

32. loung, R.A. and DaviB, R.W. (1983) Science 222, 778-782. 

33. Vieira, J, and Messing, J. (1982) Gene 19, 259-268. 

34. Toung, R.A. and Davis, R.W. (1985) In Genetic iiigineering , eds. 
Setlow, J.I. and Hollaender, A. (Planus, New lork) , Tol. 7. 

35. lAensali, U.K. (1970) Nature (Lond) 227, 680-685.' 

36. Towbin, H. f Staehelin, I. and Gordon, J. (1979) Proc. Natl. Acad. 
Sci. USA. 76, 4350-4354. 

37. Maniatie, T. , Pritsch, Z.P. and Saabrook, J. (1982) Molecular 
Cloning: A Laboratory Manual (Coldspring Harbor Laboratory, Cold 
Spring Harbor, BT). 

38. Thomas, P.A (1980) Proc. Katl. Acad. Eci. USA. 77, 5201-5205. 

>ir«do, H., Langdale, J. and 

Malcolm, A.D.B. (1985) In Huoan Genetic Diseases, ed. Davies, I.E. 
(IRL, Oxford), p. 56. 



WO- 158/05054 




- 58 




PCT/GB88/00011 



40. 01d f J.M. (1985) In Human Genetic Diseases, Ed. Davies, I.E., 
(IRL, Oxford), p. 4. 

41. Peinberg, A. P. and Yogelstein, B. (1984) Anal. Biochea. 157, 
266-267. 

42. Taylor-Papadimitriou, J. f Purkis, P. and Fentioan, I.S. (1880) J. 
Call. Physiol. 102, 317-321. 

43. Hackett, A.J., S&ith, H.S., Springer, Z.L. , Ovens, R.B., 
Nelson-Rees, W.A., Riggs, J.L. and Gardner, H.B. (1977) J. Katl. 
Cancer Inst. 58, 1795-1 BOO. 

44. Griffiths, A.B., Burchell, J., Taylor-Papadimitriou, J., Gendler, 
S.J., Lewie, A. and Tilly, R. (1987) Int. J. Cancer (submitted). 

45. Gottschalk, A., Bhargava, S. and Mury, T. In Gottschalk, A. (ed) 
Glycoproteins their composition, structure and function, pp. 



810-829, Elsevier, New lork. 

46. Hill, K.D.Jr., Reynolds, J. A. and Hill, R. (1977) J. Biol. Chem. 
252, 3791-3798. 

47. Taylor-Papadimitriou, J. , Millis, R. , Burchell, J., Nash, R. , 
Pang, L. and Gilbert, J. (1986) J. Erpt. Path. 2, 247-260. ' 



48. Swallow, D. f Griffiths, B., Bramwell, U. , Wiseman, G. and 
Burchell, J. (1986) Disease Markers 3, In Press. 



SUBSTITUTE SHEET 



• 59 - 



t 1 « 




v 

49. Abe, V. and Xuf • , D. Effects of aatnrational agents on expression 
and •ecretion of two partially characterized high aoleeular weight 
ailk-related glycoproteins is HC7-7 breast eareinoaa cells. J. 
Cell. Physiol. 126:126-136, 1986* 

* 

50. Arklie , J., Taylor-Papadiaitriu, J #9 Bodaer. W.?. t Zgan, M* and 
Millie, B. Differentiation antigens expressed by epithelial cells 
in the lactating breast are also detectable in breast cancers* 

Int. J. Cancer 28:23-29, 1981, 

51* Bolton, A.Z. and Hunter, Y.U. The lab tiling of proteins to high 

125 

specific radiosctiTities by conjugation to a I containing 
acylating agent. fiiochea # J. 133:529-538, 1973. 



52* Zarlsson, S., Svallov, D. t Griffiths, B., Coney, G. and 

lioplinaon, P. A genetic polyaorphise of a huaan urinary aucin. 
Ann. Hua* Genet. 47:263-25 9, 1983* 

53* Kearney, J*?., Radbruch, A., Liesegang, B. and Kajevsky, X. 1st 
aonse nyeloaa cell line that has lost iDnnoglebulin expression 
but peraits the construction of antibody-aecreting hybrid cell 
lines. J. Isanmol. 125:1548-1550, 1979. 



54. Helero, J. and Gonzales -Rodrigues, J. Preparation of a on o clonal 
antibodies against glycoproteins 111a of huaan platelets. Xur. J 
Biochea. 141:421-427. 1984. 

55. ciaap, J.B., Allen, A*, Gibbons, R.A. and Roberts, G. Chemical 
aspects of accins. Br. Mod. Bull. 34:25-41, 1978. 



